Text Data Mining of English Guidebooks Available at Local Airports in Japan

نویسندگان

  • Hiromi Ban
  • Takashi Oyabu
چکیده

Ishikawa Prefecture is located in the Hokuriku region in Japan. One of the main targets of the tourism industry in Ishikawa is to increase the number of tourists from foreign countries. In order to solve this problem, it is necessary to provide foreign tourists with a “language service.” In this study, in order to understand the state of language service provided to foreign tourists, we investigated what linguistic characteristics can be found in English pamphlets at Komatsu Airport and Toyama Airport, which are local airports in Japan, comparing them with pamphlets available at Narita, Kansai, Central Japan, and London Heathrow international airports. In short, frequency characteristics of characterand word-appearance were investigated using a program written in C++. These characteristics were approximated by an exponential function. Furthermore, we calculated the percentage of Japanese junior high school required vocabulary and American basic vocabulary to obtain the difficulty-level as well as the K-characteristic of each material. As a result, it was clearly shown that English pamphlets available at local airports in Japan have a similar tendency to literary writings in the characteristics of character-appearance. Besides, the values of the Kcharacteristic for the pamphlets are high, and the difficulty level is also high, especially in terms of the Japanese required vocabulary.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

ارائه مدلی برای استخراج اطلاعات از مستندات متنی، مبتنی بر متن‌کاوی در حوزه یادگیری الکترونیکی

As computer networks become the backbones of science and economy, enormous quantities documents become available. So, for extracting useful information from textual data, text mining techniques have been used. Text Mining has become an important research area that discoveries unknown information, facts or new hypotheses by automatically extracting information from different written documents. T...

متن کامل

Designing a System for Trend Analysis of Users in Website Surfing in Iran Using Data Mining and Text Mining Algorithms

Background and Aim: As of the entrance of web surfing to the lifestyle of a vast majority of people in the society and the need for a more accurate social and cultural policy making in the field, authors intended to analyze the behavior of the society users in viewing different websites so as to help politicians and practitioners. Methods: Design science research method is used in this research...

متن کامل

A Study of Translation Problems of Tourism Industry Guidebooks: An Error Analysis Perspective

This study was motivated by the researchers’ goal to unfold the quality of the English translations of Persian tourism industry texts and discover the most frequent error patterns the Iranian non-native translators have committed in such texts. Thus, the following research questions were addressed: 1) Are the English versions of Persian tourist guidebooks and multimedia compact discs provided b...

متن کامل

The Discursive Construction of “Native” and “Non-Native” ‎Speaker English Teacher Identities in Japan: A Linguistic ‎Ethnographic Investigation

Recent poststructuralist theories of identity posit identities as being discursively constructed in interactions with society, institutions, and individuals. This study used a Linguistic Ethnographic framework to investigate the discursive identity construction of two English teachers, one ‘non-native’ English speaker, and one ‘native’ English speaker, teaching English in a tertiary institution...

متن کامل

Text Mining Based on Self-Organizing Map Method for Arabic-English Documents

Computer information and retrieval is becoming increasingly sophisticated and is being exploited in more and more spheres of human activity. Many computer applications are developed as information distribution systems, of which the Internet is one of the best known and widely used. With enormous quantities of data in different languages available on the net, it is essential that more efficient ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013